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1, INTRODUCTION 

High Neutral to Earth Voltage (NTEV) in the commercial building is one of the concerns in Power 
Quality (PQ) disturbances due to hazardous to the humans, animals, electric and electronic appliances, and 
electrical networks system [1],[2]. Many researchers have discussed the high NTEV in the commercial 
building is due to improper wiring, poor grounding system, nonlinear load, and cable damaged [3-6]. 
However, the discussion is still in assumption to determine the source types of NTEV as in [7] and have not 
been proved by any artificial intelligent techniques. 

Normally, the source types of PQ disturbance in the commercial building can be identified by using 
the classification technique [8]. Nowadays, a lot of classifier tools are combined with the signal processing 
technique to classify the PQ disturbances in the commercial building [9-11]. The signal processing is utilized 
to extract the desired information which is applied as an input for the classifier tools [8]. Then, some of the 
disturbances introduce other problems in signal processing due to limitation and cannot be analyzed to 
extract the desired information accurately [12],[13]. Hence, an appropriate variable in classifier tool and 
signal processing technique need to be utilized to identify the types of NTEV problem in the commercial 
building. The NTEV occurrence needs to be aware of due to hazardous to humans, animals, electrical 
appliances, and electrical network system [14]. 
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The main contribution of this paper is to classify the source types of NTEV problems in the 
commercial building due to the triplen harmonic and transient. The transient can be divided into two 
categories; transient due to the loose termination and transient due to the lightning strike. Then, the K- 
Nearest Neighbor (K-NN) is selected as the classifier tool and has been tested with different variables, which 
are Euclidean, City block, and Chebyshev. The different variables of K-NN are utilized to identify the best 
performance in classifying the NTEV problem. Further, S-Transform (ST) is chosen to extract the features of 
NTEV problem according to the statistical analysis algorithms. ST technique is selected due to the capability 
in processing the signal without missing any information features, variables window scale, and outperform 
wavelet transform (WT) and short-time fourier transform (STFT). Finally, the performances of K-NN are 
observed based on the different variables by using confusion matrix and linear regression. According to the 
confusion matrix and linear regression results, the performance of K-NN classifier tools is analysed based on 
the percentages of accuracy, misclassification and the relationship between the target and output results. 


2. RESEARCH METHOD 
To classify the NTEV problems in the commercial building, several methods such as the ST 
operation, K-NN classifier tool should be followed 


2.1. The S-Transform (ST) Theory 

ST technique is a time-frequency pre-processing signal, which presents the result in complex 
number spectrum. Then, ST also produces the result in S-matrix which consist the multiple numbers of 
columns and rows. The ST of signal x(t) can be defined as follow [13]: 
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where, 

k, m, n=0, 1, ..., N-1 
T=sampling interval 
N=total of sampling point 


According to the ST result, the features of NTEV problem are extracted by using the statistical 
analysis technique, that include the standard deviation (3), mean (4), variance (5), skewness (6), kurtosis (7), 
and total harmonic distortion (THD) (8). Figure 1 shows the operation to classify the NTEV problem in the 
commercial building included ST, statistical analysis, and K-NN classifier tools. Then, the algorithms of 
statistical analysis are given as [15]: 
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Figure 1. The Classification of NTEV Problem 


Based on the equation in (3)-(8), the result of NTEV features can be obtained and utilized as an input for K- 
NN classifier. 


2.2. K-Nearest Neighbor (K-NN) 

K-NN can be categorized as the Nearest Neighbor (NN) family [16]. K-NN is reliable and easy to 
implement for classification by using non-parametric and lazy learning technique [17]. Basically, the K-NN 
classification operates based on the distance metric of input classifying and testing samples. Then, the 
performance of K-NN also depends on the selected number of K. Thus, the number of K 1s selected based on 
the best values before analyzing the performance of K-NN using the different variables. This paper shows 
the testing for different variables of K-NN in order to identify the best performance of K-NN in classifying 
the NTEV problem. The variables of K-NN such as the Euclidean (9), City block (10), Chebyshev (11) are 
used to analyze the NTEV problems in the commercial building. The mathematical algorithms are given 
below [18]: 


[LG —y,) (9) 


k 
> |4-y | (10) 
i=] 


Indonesian J Elec Eng & Comp Sci, Vol. 9, No. 2, February 2018 : 380 — 386 


Indonesian J Elec Eng & Comp Sci ISSN: 2502-4752 O 383 


y, -y,]) (11) 








max (\x., =X; 


9 


3. RESULTS AND ANALYSIS 

Figure 2 shows the input of classification which utilized on different variables of K-NN classifier 
tools. The number of samples that have been used as input and testing data are 124 and 126, respectively. 
Based on the samples data, K-NN analysis is utilized to identify the best categories of K-NN variables in 
classifying the NTEV problems. 
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Figure 2: The plot classification input features: (a) Fl and F2, (b) F3 and F4, and (c) F5 and F8 


3.1. Performance of K-NN Variables Based on Different K Values 

The K-NN classification uses 50 iterations with a different number of K. 

Figure 3 shows each different values of K produce the different results of K-NN variables. The best 
values of K for variables Euclidean, City block, and Chebyshev are between | until 4. The number of K 
between | until 4 shows the identical results with respect to their performance. Based on the best value of K, 
the K-NN is analyzed by using the confusion matrix technique and linear regression. 
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Figure 3 Performance of K value for K-NN types (a) Euclidean, (b) Cityblock, (c) Chebyshev 


3.2. Confusion Matrix Results 

Figure 4 shows the result of confusion matrix which has been tested by using the K-NN classifier of 
the different variables such as Euclidean, City block, and Chebyshev. The classes 1, 2, and 3 represents the 
NTEV problems due to the harmonic, loose connection, and lightning, respectively. Based on the figure, the 
numbers of samples for class 1, class 2, and class 3 are identical 42. The result of variable Euclidean shows 
that the class 1, class 2, and class 3 are 33.3%. The result of K-NN for variable City block seems similar with 
variable Euclidean. Then for the variable Chebyshev shows the results for class 1, class 2, and class 3 are 
33.3%, 31%, and 33.3%, respectively. The correct number classifies for variable Chebyshev are 42, 39, and 
42 for class 1, class 2, and class 3, respectively. 

Overall results show that the K-NN classifier using the Euclidean and City block variables are the 
best results on classifying the NTEV problems with 100% accuracy. 
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Figure 4. The K-NN Confusion Matrix for variable (a) Euclidean (b) City block (c) Chebyshev 
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3.3. Regression Results 

Table 1 elaborates the performance of K-NN based on the different variables by using linear 
regression. According to the table, the K-NN results for variables Euclidean and City block are 100% for all 
classes. That means the relationship between the target and the output are linear proportional. However, the 
variable Chebyshev shows the results for class 1 and class 3 are hundred percent linear. Then, the variable 
Chebyshev shows the class 2 result is 94.69% can be explained by the linear regression. The total results of 
K-NN are 100%, 100%, and 98.23% for variables Euclidean, City block, and Chebyshev, respectively. The 
variables Euclidean and City block show the results are similar to each other and represent as the best 
performance in analyzing the problems of NTEV due to the harmonic, loose termination, and lightning in the 
commercial building. However, the Chebyshev result is still dependable, since the relationship between the 
target and the output are close to the hundred percent. 


Table 1. Performance of K-NN based on the different variables 


Variables Class 1 (%) Class 2 (%) Class 3 (%) Total (%) 
Euclidean 100 100 100 100 
City block 100 100 100 100 
Chebyshev 100 94.69 100 98.23 


4. CONCLUSION 

This paper is presented to classify the NTEV problem in the commercial building by using the K- 
NN classifier tools. The K-NN classifier tool uses the different continuous variables, which are Euclidean, 
City block, and Chebyshev. The ST is applied as the proposed technique to extract the features of NTEV 
problem, which is then utilized as classification input. The performance of K-NN classifier with different 
variables is observed based on the confusion matrix and linear regression results. The Euclidean and City 
block variables presented the best result to classify the NTEV problems in the commercial building compared 
to the Chebyshev variable. Even though the performance of the Chebyshev variable is lower compared to the 
other variables, it is still reliable as the confusion matrix shows minor misclassification which is 2.4%. Then, 
the linear regression shows that the percentage of Chebyshev result close to perfect value which is hundred 
percent. 
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